Tags: vram* + quantization* + context length* + cli* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. A ruby script calculates VRAM requirements for large language models (LLMs) based on model, bits per weight, and context length. It can determine required VRAM, maximum context length, or best bpw given available VRAM.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "vram+quantization+context length+cli+llm"

About - Propulsed by SemanticScuttle